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REMARKS 

Claims 4, 31-32, 37-45 have been withdrawn from consideration by the examiner as 
being directed to a non-elected invention. Claims 1-2, 6-14, 24, 30, 33 and 37-45 are cancelled 
without prejudice. Claims 3, 5, 15, 16, 18, 20, 25, 26, 28, 29, 32, 34-36 and 46-60 have been 
amended to more clearly define the invention. No new matter is added. 

Specification 

The specification has been amended to include SEQ ID Nos. in order to comply 
with 37 C.F.R. § 1 .825. The sequence listing has also been amended so that the SEQ ID NO:s 
listed in the specification correspond to those in the sequence listing. In particular, the promoter 
of the elected invention is Mprl217 (SEQ ID NO:22). The sequence listing filed previously 
lists a short primer sequence for SEQ ID NO:22, and listed the sequence of MPR1217 under 
SEQ ID NO:37. The attached sequence listing now lists SEQ ID NO:22 as having the sequence 
of Mprl217, and not SEQ ID NO:37. Similarly, the attached sequence listing now lists SEQ ID 
NO:21 as having the sequence of Mprl216, and not SEQ ID NO: 17. These corrections do not 
entail new matter because the sequence of the promoter Mprl217 had been listed as SEQ ID 
NO:37, and because the specification discloses the exact steps used to derive the Mpr promoter 
1217 from the high molecular weight glutenin gene (PrHMWG-Dx5 (SEQ ID NO: 1)), which 
was well known to the public at the time the invention was made (Accession number XI 2928; 
see page 20 of the instant specification). 

In addition, the sequence listed on page 42, lines 26-27, has been designated as SEQ ID 
NO:38. SEQ ID NO:38 has been newly added to the sequence listing. Accordingly, attached 
herewith is a copy of the "Sequence Listing" (sheets 1/1 8 through 18/18) in paper form for the 
above-identified patent application as required by 37 C.F.R. §1.82 1(c) and a copy of the 
Sequence Listing in computer readable form as required by 37 C.F.R. §1.82 1(e). As required by 
37 C.F.R. §1. 821(f), Applicant's Attorney hereby states that the content of the "Sequence 
Listing" in paper form and the computer readable form of the "Sequence Listing" are the same 
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and 5 as required by 37 C.F.R. § 1.821(g), also states that the submission includes no new matter. 
Applicants request the entry of the Sequence Listing in the above-captioned patent application to 
replace the existing sequence listing. 

The specification has also been amended to insert the phrase "What is claimed is" at the 
top of page 71. 

Restriction/election 

The elected invention is drawn to the MPrl217 promoter (SEQ ID NO:22). Applicant 
also respectfully requests rejoinder of the MPrl216 promoter (SEQ ID NO:21) and the MPrl 128 
promoter (SEQ ID NO:4) into the elected group because a search of (SEQ ID NO:4) would 
necessarily result in the identification of SEQ ID NO:21 and SEQ ID NO:22. Figure 1 shows that 
the MPrl217 promoter (SEQ ID NO:22), the MPrl216 promoter (SEQ ID NO:21), and the 
MPrl 128 promoter (SEQ ED NO:4) comprise three, two or one, respectively, copies of a 
sequence comprising a G Box positioned upstream of an enhancer element. Applicant 
respectfully submits that these three promoters are structurally different aspects of the same 
invention, and would not impose an undue search burden on the Examiner. Accordingly, 
Applicant respectfully requests rejoinder of Group 4, claims 2, 3, 5, 8-30, 32, 34-36, 46-61 with 
respect to SEQ ID NO:4, and Group 21, claims 2, 3, 5, 8-30, 32, 34-36, 46-61 with respect to 
SEQ ID NO:21, to the examination of elected Group 22, claims 2, 3, 5, 8-30, 32, 34-36, 46-61 
with respect to SEQ ID NO:22, and the linking claims 1, 6 and 7. 



Claim Objections 

Claims 5, 29, 32, 34-36 and 46-61 have been amended to delete SEQ ID Nos. that refer to 
non-elected subject matter. 

The office action states that Claim 57 and its dependent claims are multiple dependent 
claims. However, it was not intended that claim 57 be a multiple dependent claim. The 
language of Claim 57 has been amended to clarify that it is an independent claim, and therefore 
is not a multiple dependent claim. Claims 59 and 61 each depend on only one independent claim 
(claim 57). 
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Claim Rejections - 35 USC 112 

Claims 1, 3, 5-6, 8, 10, 12-14, 16-18, 20, 23-26, 29-30, 32, 34-36 and 45-56 are asserted 
to be indefinite because they fail to begin with an article, such as "A" or "The". The pending 
claims have been amended so they each begin with an article. 

Claim 5 is asserted to be indefinite because it recites "SEQ ID NO:8" twice. Both of the 
recitations of the phrase "SEQ ID NO:8" have been deleted from Claim 5. 

Claims 5, 29, 32 and 34 are asserted to be indefinite because they recite multiple periods 
in the claims. Accordingly, the recitation of multiple periods in Claims 5, 29, 32 and 34 has been 
deleted. 

Claim 46 is asserted to be indefinite because it recites the term "uence". Accordingly, the 
recitation of the term "uence" has been replaced with the term "sequence". 

Claims 52-56, 58 and 60 are asserted to be indefinite because the claims refer to the 
method of claim 48 or 49, which are not methods. Accordingly, the recitation of "claim 49" has 
been deleted from claims 52-55, and the recitation of "claim 48" has been deleted from claim 56. 
The dependency of claims 58 and 60 have been changed from being dependent on claim 56 to 
being dependent on claim 57, the incorrect dependency being a result of typographical errors. 

Claim Rejections - 35 USC 102 

Claims 1-2, 6-28, 30, 35-36, 46-56, 58 and 60 are rejected under 35 U.S.C. 102(b) as 
being anticipated by Thomas et al. 

Applicants submit that for a determination of anticipation to be proper, the prior art 
reference must disclose each and every limitation of the claim. Atlas Powder Company et al v. 
IRECO, Incorporated et al, 190 F.3d 1342, 1347 (Fed. Cir. 1999). 

Applicant submits that the cited reference does not teach a chimeric promoter that 
comprises an AS2 box, and that accordingly the cited reference does not anticipate the instant 
claims which recite a chimeric promoter that comprise an AS2 box. 

The office action describes the cited reference as teaching a chimeric promoter which 
"includes sequences from the CaMV promoter, which inherently contains an asl (conferring root 
specific expression) and as2 box (conferring photosynthetic tissue expression)..." . 
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"An c asT and an 'as2' element are being interpreted as being present in any chimeric 
promoter that includes the CaMV promoter, as both of these sequences originate from the 
CaMV promoter". 

Applicants respectfully disagree with said assertion, on the grounds that the chimeric 
promoter taught by Thomas et al. does not contain the as2 element. 

Applicants first note that the chimeric promoter taught by Thomas et al. includes only a 
part of the CaMV promoter. Thomas et al teach that the CaMV "promoter carries a 5' deletion 
termination 89 bp upstream from the transcription initiation site and, therefore, retains the 
TATA and CCAAT-like boxes but not the putative enhancer sequences", see column 1 of page 
1172. 

Applicants also note that in the CaMV promoter, the asl box extends from -82 to -64, 
while the as2 box is located at -85 to -105 (see page 1056, lines 1-15 of column 1 of an article 
by Verdaguer et al. {Plant Molecular Biology 37:1055-1067; 1998), Appendix A). 

Because the sequences upstream of -89 of the CaMV promoter are not present in the 
referenced chimeric promoter, the as2 element which is located at -85 to -105, is also not 
present in the chimeric promoter taught by Thomas et al. Therefore, the chimeric promoter 
taught by Thomas et al. comprises a truncated CaMV promoter which contains the aspl box, but 
not the asp2 box. 

Because Thomas et al. does not teach a chimeric promoter comprising an as2 box, as 
required by claims 20-23, 25-28, Applicant respectfully requests reconsideration and withdrawal 
of the 102 rejection of claims 20-23 and 25-28. 

Applicant contends that Thomas et al. does not teach a chimeric promoter with all the 
recited limitations of Claim 15, which is drawn to a chimeric promoter of gene expression 
comprising at least one transcriptional regulatory sequence from a gene encoding a high 
molecular weight wheat glutenin, which comprises a TATA box, a transcription start site, at least 
one enhancer box upstream of said TATA box and said transcription start site (+1), and at least 
one GATA box upstream of said at least one enhancer box, wherein said GATA box confers 
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light-regulatable expression on a transcription unit operably linked to said promoter. 
Specifically, the office action does not specify, nor can applicants find, where Thomas et al, 
teaches a chimeric promoter having a GATA box which confers light-regulatable expression on 
a transcription unit operably linked to said promoter. Accordingly, Applicant respectfully 
requests reconsideration and withdrawal of the 102 rejection of claims 15 . 

Applicant contends that Thomas et al. does not teach a chimeric promoter with all the 
recited limitations of Claim 16, which is drawn to a chimeric promoter of gene expression 
comprising at least one transcriptional regulatory sequence from a gene encoding a high 
molecular weight wheat glutenin, which comprises a TATA box, a transcription start site, at least 
one enhancer box upstream of said TATA box and said transcription start site (+1), and 
comprising at least one cereal box upstream of the enhancer box. Applicant contends that the 
phrase "cereal box" is an art recognized term, which is defined as a motif containing a GCN4- 
like motif (GLM), (see column 2, page 71 1 of Guillaumie et al. Genome 47:705-713 (2004) 
Appendix "B"). Specifically, the office action does not specify where Thomas et al teaches a 
chimeric promoter which comprises a cereal box as required by claim 16. Further, the Office 
action does not specify where the Thomas et al. reference teaches (1) a chimeric promoter which 
comprises a cereal box that confers seed-specific expression on a transcription unit operably 
linked to the promoter, as required by claim 17; (2) a chimeric promoter which comprises two 
cereal boxes upstream of an enhancer box wherein no transcriptional regulatory sequences are 
between said cereal boxes, as required by claim 18; or (3) a chimeric promoter comprising two 
cereal boxes upstream of at least one enhancer box, wherein the cereal boxes are contiguous, as 
required by Claim 19. Accordingly Applicant respectfully requests reconsideration and 
withdrawal of the 102 rejection of claims 16-19. 

Applicant contends that Thomas et al. does not teach a chimeric promoter, a cell 
comprising the promoter, nor a method for obtaining the cell, wherein the chimeric promoter 
contains all the recited limitations of the chimeric promoter recited in Claims 5, 29, 32, 34-36, 
46-49, and 57-60. Claims 5, 29, 32, 34-36, 46-49, and 57-60 comprising a sequence selected 
from the group consisting of SEQ ID NO:22, SEQ ID NO:21 and SEQ ID NO:4, and these 
sequences are not taught by Thomas et al. Accordingly, Applicant respectfully requests 



Attorney Docket No.: 18433/2022 

-30- 

reconsideration and withdrawal of the 102 rejection of Claims 5, 29, 32, 34-36, 46-49, and 57- 
60. 

Applicant contends that Thomas et al. does not teach a method for expressing a nucleic 
acid sequence encoding a polypeptide in a cell comprising a vector encoding a chimeric 
promoter comprising a sequence selected from the group consisting of SEQ ID NO:22, SEQ ED 
NO:21 and SEQ ID NO:4, as required by claim 50-56, because these sequences are not taught by 
Thomas et al. Accordingly, Applicant respectfully requests reconsideration and withdrawal of 
the 102 rejection of claims 50-56. 

Claims Rejections - 35 USC 103 

Claim 3 is rejected under USC 103(a) as being unpatentable over Thomas (as set forth in 
the rejection of claims 1, 2, 6-28, 30, 35, 36, 46-56, 58 and 60 under 35 USC(b)) in view of 
Halford et al. 

To establish a prima facie case of obviousness, the Federal Circuit has stated in In re 
Geiger (815 F.2d 686, 688, 2 U.S.P.Q.2d 1276, 1278 (Fed. Cir. 1987) that cc [o]bviousness cannot 
be established by combining the teachings of the prior art to produce the claimed invention, 
absent some teaching[,] suggestion or incentive supporting the combination." 

Applicant traverses the rejection on the grounds that a teaching, suggestion or incentive 
supporting the combination of the cited references is absent in the 103 rejection. 

Applicants agree with the Office Action's statement that Halford and Thomas both teach 
the expression of heterologous sequences using promoters derived from HMWG genes, and that 
Thomas et al. does not teach the use of the Glu-lDx5 promoter. The Office action also states 
that Halford suggests "that the HMW regulatory sequences may be useful for driving high levels 
of expression [of foreign genes] in tobacco". However, Applicant notes that the Halford 
reference does not teach or suggest the use of chimeric promoters as required by claim 3. 

The Office Action further states that "one of skill in the art would have been motivated to 
substitute the Glu-lDx5 promoter taught by [Halford for the ] Glu-Dl-2 promoter taught by 
Thomas because Halford suggests using their particular promoters for the enhanced expression 
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of heterologous genes in tobacco, which is where Thomas is expressing their heterologous 
gene". 

Applicant contends that the office action does not address where the Halford reference 
provides motivation for substituting the Glu-lDx5 promoter taught by Halford for the Glu-Dl-2 
promoter taught by Thomas in a chimeric promoter. Applicant respectfully submits that the 
motivation does not provide for combining the references to produce a chimeric promoter as 
required by the instant claim, but only provides motivation for producing a heterologous gene. 
Because the office action does not supply motivation for combining the references to produce the 
chimeric promoter of the instant invention, no prima facie case of obviousness has been 
established. Applicant respectfully requests reconsideration and withdrawal of the rejection. 



Applicant submits that all claims are allowable as written and respectfully request early 
favorable action by the Examiner. If the Examiner believes that a telephone conversation with 
Applicant's attorney/agent would expedite prosecution of this application, the Examiner is 
cordially invited to call the undersigned attorney/agent of record. 



CONCLUSION 



Respectfully submitted, 
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Colocation between a gene encoding the bZip 
factor SPA and an eQTL for a high-molecular- 
weight glutenin subunit in wheat (Triticum 
aestivum) 

Sabine Guillaumie, Gilles Charmet, Laurent Linossier r Valerie Torney, 
Nathalie Robert, and Catherine Ravel 



Abstract: The quality of wheat grain is largely determined by the quantity and composition of storage proteins 
(prolamins) and depends on mechanisms underlying the regulation of expression of prolamin genes. The endosperm- 
specific wheat basic region leucine zipper (bZIP) factor storage protein activator (SPA) is a positive regulator that binds 
to the promoter of a prolamin gene. The aim of this study was to map SPA (the gene encoding bZIP factor SPA) and 
genomic regions associated with quantitative variations of storage protein fractions using F 7 recombinant inbred lines 
(RILs) derived from a cross between Triticum aestivum 'Renan* and T. aestivum 'Recital*. SPA was mapped through 
RFLP using a cDNA probe and a specific single nucleotide polymorphism (SNP) marker. Storage protein fractions in 
the parents and RILs were quantified using capillary electrophoresis. Quantitative trait loci (QTLs) for protein were de- 
tected and mapped on six chromosome regions. One QTL, located on the long arm of chromosome IB, explained 70% 
of the variation in quantity of the x subunit of Glu-Bl. Genetic mapping suggested that SPA is located on chromosome 
arm iL and is also present in the confidence interval of the corresponding QTL for Glu-Bl x on 1BL, suggesting that 
SPA might be a candidate gene for this QTL. 

Key words: Triticum aestivum, quantitative trait locus (QTL), single nucleotide polymorphism (SNP), storage protein 
activator (SPA), high-molecular-weight glutenin subunit (HMW-GS). 

Resume : La qualite du grain de ble" est largement d&erminee par la quantite et la composition en proteines de reserve 
(prolamines). Elle depend 6galement de mecanismes qui controlent la regulation de Texpression des genes codant pour 
les prolamines. Le facteur de transcription bZIP, specifique de T albumen, activateur pour les proteines de reserve (SPA) 
est un regulateur du promoteur d'un gene de protamine. SPA et les regions chromosomiques associ^es avec les varia- 
tions quantitatives de differentes fractions proteiques ont ete cartographies a I'aide d'une population F 7 dc lign^es re- 
combinantes obtenue a partir d'un croisement entre Triticum aestivum 'Renan' et T. aestivum 'Recital*. SPA a 6te* 
d'abord assigne sur des lignees aneuploides grace a une sonde d'ADNc. II a aussi ete cartographie* genetiquement a 
Paide d'un marqueur polymorphe base" sur une mutation ponctuelle (SNP). Les differentes fractions pour les prot&nes 
de reserve ont ete mesurees par electrophorese capillaire. Six loci a caractere quantitatif (QTLs) pour les proteines ont 
ete detectes. Le QTL pour la quantite de glutenines de haut poids moleculaire (HMW-GS) codee par le locus GIu-BJ-J 
explique 70 % de la variation phenotypique observee. L'assignement de SPA sur les lignees aneuploides reVele trois 
loci, situes sur le bras long des chromosomes home*ologues du groupe 1 . Le SNP d^veloppe* montre qu'un des genes de 
SPA co-localise avec le QTL pour la quantite de glutenines de haut poids moleculaire codee par le locus Glu-Bl-]. 
SPA serait done un gene candidat pour ce QTL. 

Mots cles : Triticum aestivum, locus a caractere quantitatif (QTL), polymorphisme base sur une mutation ponctuelle 
(SNP), activateur pour les proteines de reserve (SPA), glutenines de haut poids moleculaire (HMW-GS). 



Received 4 November 2003. Accepted 8 March 2004. Published on the NRC Research Press Web site at http://genome.nrc.ca on 
26 July 2004. 

Corresponding Editor: F. Belzile. 

S. Guillaumie, G. Charmet, V. Torney, N. Robert, 1 and C. Ravel. 2 Unite* mixte de recherches INRA University Blaise Pascal 
Amelioration et Sante* des Plantes, 234 Avenue du Br6zet, 63039 Clermont-Ferrand CEDEX 2, France. 
L. Linossier. ULICE, Zone d'Activite Commericale des portes de Riom, 63200 Riom CEDEX, France. 

'Present address: ISAB, rue Pierre Waguet, BP 30313, 60026 Beauvais CEDEX, France. 
Corresponding author (e-mail: ravel@clermont.inra.fr). 



Genome 47: 705-713 (2004) 



doi: 10.1I39/G04-031 



© 2004 NRC Canada 



706 



Genome Vol. 47, 2004 



Introduction 

The endosperm of cereal grains is a storage organ contain- 
ing proteins, oil, and carbohydrates, which are used by the 
embryo during germination. The most abundant wheat stor- 
age proteins exclusively synthesized in endosperm are 
prolamins, making up more than 90% of seed storage pro- 
teins. Wheat prolamins are divided into two groups: gliadins 
and glutenins. Gliadins consist of complex mixtures of 
monomeric proteins and are classified as a-, (i-, y-, and co- 
gliadins according to their electrophoretic mobility, whereas 
glutenins are formed by high-molecular-weight and low- 
molecular-weight subunits (HMW-GS and LMW-GS), ag- 
gregated into high-mass polymers held together by 
intermolecular disulfide bonds. Gliadins and glutenins are 
important for bread-making quality, since they are involved 
in gluten network formation (Branlard and Dardevet 1985; 
Metakovsky et al. 1997; Branlard et al. 2001). Gliadins en- 
hance gluten extensibility, whereas HMW-GS determine vis- 
cosity (strength) of the dough (Payne 1987; Shewry et al. 
1992). In hexaploid wheat, HMW-GS are encoded by three 
loci, Glu-AI, Glu-BU and Glu-DJ, located on homeologous 
group-1 chromosomes (Payne 1987). Each locus is made of 
two tightly linked genes, e.g., Glu-DI-1 and Glu-DI-2, 
which encode the x and y subunits, respectively. It should be 
noticed that Glu-Al-2 (always) and Glu-BJ-2 (in some geno- 
types) have null alleles, so that Glu-Bly is generally absent. 

The expression of seed storage protein genes is controlled 
by a network of regulatory proteins that interact with spe- 
cific DNA elements and modulate their transcription. 
Transcriptional factors binding to specific DNA motifs are 
involved in such a regulation, as shown in the cases of 
Opaque-2 (Motto et al. 1989) and prolamin box binding fac- 
tor (PBF) (Vicente-Carbajosa et al. 1997). In barley, four 
transcription factors involved in the expression of genes cod- 
ing for horde in seed storage proteins have been identified 
(Mena et al. 1998; Vicente-Carbajosa et al. 1998; Ofiate et 
al. 1999; Carbonero et al. 2000; Diaz et al. 2002). In wheat, 
Hammond-Kosack et al. (1993) demonstrated that two pro- 
tein factors, endosperm box factors I and II (ESBF-I and 
ESBF-II), could bind to the bifactorial m-acting element of 
an LMW-GS gene (Colot et al. 1989). This c/s-acting ele- 
ment, called endosperm box, is generally well conserved in 
most seed storage protein gene promoters and is composed 
of the close juxtaposition of the GCN4 -like m oti£.(.GLM) 
and the endosperm motif (EM), afso~termed the prolamin- 
box (P-box) (Colot et al. 1989). Albani et al. (1997) showed 
that a leucine-zipper protein called storage protein activator 
(SPA) was able to bind to the GLM and thus corresponded 
to ESBF-II, whereas Mena et al. (1998) reported that an- 
other transcriptional protein called wPBF (wheat prolamin- 
box binding factor), which would probably correspond to 
ESBF-I (Conlan et al. 1999), was able to bind EM. Albani et 
al. (1997) obtained the SPA cDNA from Triticum aestivum 
'Chinese Spring', which is a 1647-bp-Iong sequence that 
contains an open reading frame coding for a protein of 409 
amino acids. This cDNA is also characterized by several 
ATG codons in its leader sequence that determine short up- 
stream open reading frames, a feature shared by genes en- 
coding transcription factors, such as Opaque-2 (Lohmer et 
al. 1993) and Blzl (Vicente-Carbajosa et al. 1998). They 



also demonstrated, by functional analysis, that SPA activates 
transcription through binding to the GLM of an LMW-GS 
gene of hexaploid wheat. This suggests that this gene is a 
target for SPA. No reports are available for any other targets 
of SPA or for its chromosomal location. 

The aim of this study was to identify genomic regions as- 
sociated with quantitative variations of storage protein frac- 
tions (QTLs for protein) and to map SPA using a population 
of RILs (F 7 ) derived from a cross between Triticum aestivum 
'Renan' and T. aestivum 'Recital'. 

Material and methods 

Plant materials 

Two French bread wheat cultivars, 'Renan', a superior 
bread-making wheat, and 'Recital*, a standard bread-making 
wheat, were used to develop a mapping population. 'Renan' 
is characterized by a level of proteins that is higher than that 
in 'Recital' when grown in the same environment. These two 
cultivars have the same alleles for Glu-Al-1, Glu-Bl-2, Glu- 
Dl-h and Glu-Dl-2 HMW (alleles 2*, 8, 5, and 10, respec- 
tively) and differ for the alleles of Giu-BJ-1 ('Renan* has al- 
lele 7 and 'Recital' has allele 6). A mapping population of 
194 F 7 recombinant inbred lines (RILs) was obtained by sin- 
gle-seed descent (SSD) from the cross between these two 
lines. The RILs and the parental lines were sown in field tri- 
als at INRA plant breeding station of Clermont-Ferrand, 
France (45°4T N, 3°10' E, 329 m elevation) with two repli- 
cations each in 1998 and 1999 as previously described 
(Groos et al 2002). Plots were harvested in 1999 and 2000, 
respectively. 

Aneuploid lines of 'Chinese Spring' used in this study 
consisted on 20 nullitetrasomic (NT) and 36 ditelosomic 
(Dt) lines kindly provided by Dr. S. Reader, John Innes Cen- 
ter, Norwich, UK. 

Genomic DNA was extracted from fresh leaves. The ex- 
traction method reported by Lu et al. (1994) was used for 
RILs and parental lines, whereas the method previously de- 
scribed by Murigneux et al. (1993) was used for aneuploid 
lines. 

Genetic map 

A genetic linkage map was obtained from this population 
and described in Groos et al. (2002). This map comprises 
436 loci consisting of RFLP, microsatellites, and AFLP 
markers spread over the genome for a total length of 
2740 cM, i.e., nearly 80% of the International Triticeae 
Mapping Initiative (ITMI) reference map total length (Leroy 
et al. 1997). This map also included some polymorphic stor- 
age protein loci, Glu-Bl-1 and Gli-Al. Linkage analyses 
were performed using Mapmaker/Exp version 3.06 (Lander 
et al. 1987). The Kosambi mapping function (Kosambi 
1944) was applied to transform recombination frequencies 
into additive distances in centimorgans (cM). Linkage 
groups were assigned to chromosomes by comparison to ref- 
erence maps using microsatellite loci (Cadalen et al. 1997). 

QTL analysis 

For each genotype, quantification of LMW glutenin sub- 
units, HMW glutenin subunits, cc+p^y gliadins, and co 
gliadins was achieved using capillary electrophoresis. Ex- 
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Fig. 1. Southern analysis. Chromosomal location of gene encoding SPA using Triticum aestivum 'Chinese Spring' (CS). 
(a) Nullitetrasomic lines (NT); T is followed by the name of the pair of duplicated chromosomes introduced in place of the lacking 
pair following N, e.g., N1AT1B corresponds to the nulli 1A tetra IB line, i.e., chromosome 1 A was substituted by chromosome IB. 
(6) Ditelosomic lines (Dt); the lacking chromosome arm followed Dt, e.g., the ditelosomic line DtlDS lacks the short arm of chromo- 
some 1 D. M Raoul, Raoul molecular marker. Disappearance of a band is marked by an arrow. 



(a) 
kb 



9 - 
7.4- 

5.6- 



ffl 


< 


< 


< : 




CQ 


< 




in 


CQ 


GQ 


Q 


GO 


CD 


Q 


< 


CD 


1^ 


T— 






<N j 




CO 


co 








in 


in 


in 


CO 


CD 


CO 








h- 




H 


H 




H 


H 








H 


h- 




H 


H 






a 


CQ 


o 


CO 


Q 


* 


CO 


Q 


1 


Q 




00 


Q 




m 


Q 




CQ 






T- 


CM 


CM 


CO 


CO 


CO 






LO 


in 


in 


(0 


CD 


CD 






z 


Z 


z 


Z 


Z 


Z 


z. 


Z' . : 


z 


Z 


Z 


Z 


z 


z 


z 


z 


z 


Z 1 







































H 

Q 

Z 



3 
O 
CO 

a: 




(b) 




traction and separation procedures were adapted from those 
described elsewhere (Carceller and Aussenac 1999). 
Gliadins were specifically extracted in 50% v/v ethanol. 
Glutenins were precipitated with propanol-2 (initial concen- 
tration 50%; final concentration 70%), and then resuspended 
in the presence of SDS and 2-mercaptoethanol. All extracts 
were assayed by using a Lowry method (BioRad, Hercules, 
Calif., ref. No. 5000116) modified to be usable in the pres- 
ence of reducing agent, i.e., after its neutralization by the ad- 
dition of iodoacetamide. Glutenins were separated on the 
basis of their molecular mass for 15 min at 15 kV and 20 °C 
in a polyacrylamide gel (BioRad, ref. No. 148-5032) diluted 



by 5% v/v methanol (capillary gel electrophoresis). An un- 
coated silica capillary of 24 cm total length (18 cm for the 
separation) and 50 |im internal diameter (BioRad, 
ref. No. 148-3060) was used. Gliadins were separated in free 
zone (capillary zone electrophoresis), in a phosphate buffer 
(BioRad, ref. No. 148-50 11), for 20 min at 22 kV and 45 °C. 
An uncoated silica capillary of 26 cm total length (20 cm for 
the separation) and 20 |im internal diameter (Thermo 
Finnigan, Ringoes, N.J. ref. TSP 20375) was used. Results 
were expressed as proportion of dry weight. 

QTL detection was based on the Splus (1993) "home- 
made" program (Groos et al. 2002). Briefly, this program 
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Fig. 2. Alignment of sequences corresponding to the 5' part of SPA. y09013 corresponds to 5' part of the sequence Y09013. 'Recital* 
and 'Renan' are the corresponding sequences from genomic DNA of Triticum aestivum 'RScitaF and T. aestivum 'Renan* cultivars, re- 
spectively. Each row has 60 nucleotides. The black box shows the start codon defined by Albani et al. (1997). Bold character indicates 
the SNP leading to a stop codon (TGA) instead of a TCA as in 'Renan*. The grey box shows an ATG codon that could constitute a 
potential start codon for 'Renan*. 

y0 9013 TTCGCCGGTTCCCACGGGGTTAAAAAGTTTAAGCCATACCATTCCTCGCCCAATTTTGTT 
Recital TTCGCCGGTTCCCACGGGGTTAAAAAGTTTAAGCCATACCATTCCTCGCCCAATTTTGTT 
Renan TTCGCCGGTTCCCACGGGGTTAAAAAGTTTAAGCCATACCATTCCTCGCCCAATTTTGTT 



y 0 9 0 1 3 CTAATC AATCACCTTTGCC CCAGTTCCATTTCGGAAAGCTTCC^^GAGCCCGTGTTCTT 

Recital CTAATCAATCACCTTTGCCCCAGTTCCATTTCGGAAAGCTTCCgggGAGCCCGTGTTCTT 

Renan CTAATCAATCACCTTTGCCCCAGTTCCATTTCGGAAAGCTTCCSBGAGCCCGTGTTCTT 

y0 9013 CTCACTGGAGGAGGCGATGCCCGAGCCCGACTCTAACCCCTGCCGGACCTCGTCGCCGCC 

Recital CTCACTGGAGGAGGCGATGCCCGAGCCCGACTCTAACCCCTGCCGGACCTCGTCGCCGCC 

Renan CTGACTGGAGGAGGCG^gCCCGAGCCCGACTCTAACCCCTGCCGGACCTCGTCGCCGCC 

y0 9013 GCTGGAGGCACACATGCTCGTCGCAGGACTCGGAGGAGTGGGCGCCGGCGAGGTCGTCGG 

Recital GCTGGAGGCACACATGCTCGTCGCAGGACTCGGAGGAGTGGGCGCCGGCGAGGTCGTCGG 

Renan GCTGGAGGCACACATGCTCGTCGCAGGACTCGGAGGAGTGGGCGCCGGCGAGGTCGTCGG 



y0 9013 CGGGTGCGCGACGAACGAGTGCGCGACAGAATGGTGCTTCCAGAAGTTCGTGGACGAGCC 

Recital CGGGTGCGCGACGAACGAGTGCGCGACAGAATGGTGCTTCCAGAAGTTCGTGGACGAGCC 

Renan CGGGTGCGCGACGAACGAGTGCGCGACAGAATGGTGCTTCCAGAAGTTCGTGGACGAGCC 

y 0 9 0 1 3 GTGGCTGCTC AACGTCCCC ACCGCGCC AGTGGCGAACCCCGAAGCTTCGACGCTTTACCC 

Recital GTGGCTGCTCAACGTCCCCACCGCGCCAGTGGCGAACCCCGAAGCTTCGACGCTTTACCC 

Renan GTGGCTGCTCAACGTCCCCACCGCGCCAGTGGCGAACCCCGAAGCTTCGACGCTTTACCC 

y0 9013 TAATCCCACGGCCGAGGGGAGCCGCAAGAGGCCGTACGACGTCCATGAGATGGTGGGCCC 

Recital TAATCCCACGGCCGAGGGGAGCCGCAAGAGGCCGTACGACGTCCATGAGATGGTGGGCCC 

Renan TAATCCCACGGCCGAGGGGAGCCGCAAGAGGCCGTACGACGTCCATGAGATGGTGGGCCC 

y0 9013 GGAGGAGGTCATCCCCACGCCGCCTGCGGCGAGCCCGGTGGTGGACCCCGTGGCGTACAA 

Recital GGAGGAGGTCATCCCCACGCCGCCTGCGGCGAGCCCGGTGGTGGACCCCGTGGCGTACAA 

Renan GGAGGAGGTCATCCCCACGCCGCCTGCGGCGAGCCCGGTGGTGGACCCCGTGGCGTACAA 



y0 9013 CGCGAT 
Recital CGCGAT 
Renan CGCGAT 
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Fig. 3. SNP polymorphism in a sample of F 7 recombinant inbreed lines (RJLs) obtained from the cross between 'Renan' (Rn) and 
'Recital' (Rc). This polymorphism was revealed by PCR using Pmutl and Pmut2 primers. M, molecular weight marker. The number of 
each RIL used is indicated. 



1018 pb 
506 pb 



1018 pb 
506 pb 




first performs a one-way analysis of variance (ANOVA) 
with every marker using a non-stringent threshold. A multi- 
ple regression is then carried out using the significant mark- 
ers previously detected to select a subset of non-related 
significant markers. These markers, except those located on 
a given linkage group, are further used as covariates when 
scanning this chromosome using the marker regression 
method for fitting either a one-QTL (Kearsey and Hyne 
1994) or two-QTL model (Hyne and Kearsey 1995). The 
95% confidence intervals of QTL locations and additive ef- 
fects were established by bootstrapping (Visscher et al. 
1996) using 1000 replicates. 

Sequence analysis 

The cDNA sequence of SPA (Albani et al. 1997), acces- 
sion number Y09013, was compared to sequences available 
in the Wheat Genoplante expressed sequence tag (EST) 
databank using BLASTN software (Altschul et al. 1997). 
This sequence was also used to design primers by Primer3 
software (Rozen and Staletsky 2000) for PCR amplification 
from genomic DNA. Partial sequences of SPA obtained from 
each parental line were assembled and verified with the 
Staden package (Staden et al. 2000). In the case of doubtful 
peaks, an undefined nucleotide (N) was introduced into the 
sequence. Sequences obtained were aligned for mutation de- 
tection. Suitable pairs of primers characterizing detected mu- 
tation were designed with Primer3 software using a melting 
temperature of about 60 °C. In this pair, the last 3' nucleo- 
tide of one primer corresponds to the mutation. 



PCR primers and PCR amplification 

Left and right primer sequences designed to amplify the 5' 
segment of SPA were PI (5'-GGCATCTCCGTCCTCTCTTC- 
30 and Pr (5'-TCCAACTTCCGTCTGAGCAT-3'), respec- 
tively. PCRs were performed for each of the parental lines of 
the mapping population in a 40-fiL final volume containing 
80 ng of genomic DNA, 10% v/v glycerol, 250 |iM of each 
dNTP, 0.4 H-M of each primer, 1 U Taq DNA polymerase 
(Roche, Basel, Switzerland) and 1* Taq polymerase buffer. 
Cycles were 94 °C for 4 min; 35 cycles of 94 °C for 1 min, 
60 °C for lmin 30 s, and 72 °C for 2 min ; and a final exten- 
sion at 72 °C for 10 min. Amplification products were re- 
solved on TAE 1 x 1 .2% w/v agarose gels. After running for 
30 min at 100 V, gels were stained with ethidium bromide 
for 20 min in a 1 mg/L solution and visualized on a UV ta- 
ble. Double-strand sequencing of PCR products was per- 
formed by Genome Express. PCRs based on primers 
designed for SNP mutation were performed for RILs using 
the same protocol. 

Southern blot analysis 

Aneuploid lines were used to assign SPA to specific chro- 
mosomes and chromosome arms (Sears 1954, 1966; Ander- 
son et al. 1992). Twelve micrograms of plant genomic DNA 
was digested for 5 h at 37 °C with HindlU, subjected to 
electrophoresis on a 0.8% w/v agarose gel, and transferred to 
a nylon membrane (Hybond-N+, Amersham, Piscataway, 
N.J.)- A cDNA clone annotated as SPA and extracted from 
the Wheat Genoplante cDNA library was amplified using the 
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Fig. 4. Genetic mapping of SPA using the SNP revealed by 
Pmutl and Pmut2 primers on chromosome IB (112 cM in 
length). On the right, anchor markers (bold) and SPA marker, 
followed by the distance to the neighbouring anchor marker. The 
QTL for the quantity of x subunit of Glu-BI (QTL qgtuBx ) is rep- 
resented on left hand side, with its bootstrap confidence interval 
(white box). 
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SP6/T7 primers and the purified PCR product was 
radiolabeled by a random primer labelling kit (Megaprime, 
Amersham Biosciences). The blots were hybridized over- 
night at 65 °C with the SPA cDNA probe. After hybridiza- 
tion, the blots were washed at 65 °C once with a 2% wlv 
SSC solution for 20 min, followed by two washes with a 0.5x 
SSC and O.lx SDS solution for 20 min. The blots were ex- 
posed to X-ray film to produce autoradiograms. 

Results 

BLAST analysis and RFLPs 

BLAST analysis revealed that the Wheat Genoplante EST 
data contains only one contig, G468.1 14N24F01 082 1.2.1, 
which is 1645 bp long and corresponds to the sequence 
Y09013 (1647 bp). The alignment between these two se- 
quences showed a score of 2940 (E value = 0). The length of 
the alignment between these two sequences was 1645 bases, 
showing that there was 97% identity between them. The next 
alignment found by BLASTN showed only a score of 56 (E 
value = 1 x 10" 6 ) for a clone 1126 bp in length. 



The contig G468.1 14N24F0 1082 1.2.1 contains seven 
ESTs, one of them (G468.1 14N24F0 10821) belonging to 
Genoplante. This EST, 585 bp long, belongs to a clone ex- 
tracted from a cDNA library isolated from developing grains 
of 'Recital' and corresponds to the 3' part of the sequence 
Y09013. Some differences existed between the sequence of 
the clone belonging to Genoplante and sequence Y09013. 
This may indicate that they do not characterize the same 
homoeologous gene. The cDNA probe used resolved a dis- 
tinct RFLP locus on chromosome ID. Furthermore, fainter 
bands allowed detection of not so distinct RFLP loci on 
chromosomes 1A and IB. These results suggest that at least 
one SPA locus is present per diploid genome (Fig. \a). This 
assignment was confirmed by the results obtained with Dt 
lines, which also showed that SPA is located on the long arm 
of chromosomes 1A, IB, and ID (Fig. \b). 

Detection of a SNP and mapping of SPA 

The primers PI and Pr were positioned on the sense strand 
of the sequence Y09013 at 57 bp and 571 bp, respectively. 
These primers gave PCR products of about 500 bp from 
genomic DNA of 'Renan' and 'Recital'. This size fit that ex- 
pected from the cDNA Y09013, indicating the absence of 
any intron in this part of the gene. Figure 2 shows the align- 
ment between the sequence Y09013 and those of 'Renan' 
and 'Recital', confirming the absence of an intron in this 
part of the gene. Only one obvious difference appeared be- 
tween the two cultivars at position 123, 17 bases after the 
potential start codon (Albani et al. 1997). The TCA codon 
present in 'Recital' and 'Chinese Spring' was substituted by 
TGA in 'Renan'. This mutation introduced a stop codon in 
the 'Renan' sequence, possibly leading to a truncated pro- 
tein. Specific primers for each allele were designed. The for- 
ward and reverse primers used to map this mutation were 
Pmutl (5'-ATGGAGCCCGTGTTCTTCTG-3'), correspond- 
ing to the 'Renan' allele, and Pmut2 (5'-CACCATCTCATG- 
GACGTCGTA-3'), respectively, producing an amplicon of 
3 1 1 bp (Fig. 3). This pair of primers revealed the expected 
band of 31 1 bp characterizing the 'Renan' allele in 84 RILs, 
whereas 109 RILs give no band, thus indicating that they 
had the 'Recital' allele. Thanks to this SNP, we were able to 
map SPA on the long arm of chromosome IB, 1.3 cM away 
from the Glu-Bl-1 structural gene (Fig. 4). This result is in 
agreement with that obtained by Southern assignment. 

QTL detection 

QTLs for storage protein fractions were detected and 
mapped on six chromosome regions (Table 1). Three QTLs 
had small effects and large confidence intervals (on chromo- 
somes 1A, 2B, and 5A), but the QTL on chromosome 7A 
appeared to have a relatively large effect, although its confi- 
dence interval was also quite large. One of the QTLs located 
on chromosome 1A colocated with its structural gene, Gli- 
Al. The QTL located on the long arm of chromosome IB 
had a strong effect (r 2 = 70%) on the quantity of the x sub- 
unit of Glu-Bl (Glu-Blx) and smaller effects on the quantity 
of y subunits of Glu-DI (Glu-Dly) and total HMW-GS, with 
the allele from 'Renan' giving higher Glu-Blx and total 
HMW-GS content, whereas it had a small, opposite effect on 
the quantity of Glu-Dly. This QTL explained 70% of 
phenotypic variation for Glu-Blx, and was located with 
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Table 1. Summary of QTLs identified in the 'Renan* x 'Recital' RIL population for quantitative variation of storage protein fractions 
measured by capillary electrophoresis. 



Storage protein quantity 


Chromosome 


Field trial 3 


h2 b 


Detection power* 


Map location (CI./ 


Allele 6 


co gliadins 


1A 


CFOO 


8.1 


0.905 


36.7 (6.9-69) 


Recital 


HMW-GS 


IB 


CFOO 


8.2 


0.78 


79.3 (16.2-85.2) 


Renan 


Glu-Blx 


IB 


CF99-CF00 


70.3 


1 


80.8 (77.7-82.3) 


Renan 


Glu-Dlx 


IB 


CF99 


10.5 


0.895 


86.6 (66.5-96.7) 


Renan 


Glu-Dly 


IB 


CF99-CF00 


42.2 


1 


78.1 (72.8-82.1) 


Recital 


a + p + 7 gliadins 


2B 


CF99 


11.6 


0.935 


81.4 (51.7-115.2) 


Recital 


LMW-GS 


2B 


CF99-CF00 


11.4 


0.92 


89.6 (80.1-125.8) 


Recital 


HMW-GS 


5A 


CF99 


15.5 


0.975 


54.1 (18.7-87.9) 


Renan 


Glu-Dlx 


5A 


CF99 


8.6 


0.83 


55 (7.1-143.1) 


Renan 


LMW-GS 


7A 


CF99-CF00 


36.6 


0.885 


142 (82.6-150.2) 


Renan 


LMW-GS 


7D 


CF99 


8.4 


0.845 


62.1 (50.4-74.5) 


Renan 



"CF99 and CFOO correspond to field trials harvested at Clermont-Ferrand in 1999 and 2000, respectively. 
^Heritability, i.e., the percentage of phenotypic variance explained by the QTL. 

Corresponds to the proportion of the bootstrap replicates in which the QTL was found to be significant (out of 1000 replicates). 
Confidence intervals of QTL locations are the 2.5 and 97.5 percentile of the empiric bootstrap distribution. 
'Indicates which parental allele is associated with the highest quantity. 



fairly good precision. Two markers are located within its 8- 
cM confidence interval and can thus be postulated as candi- 
date genes: the Glu-Bl-1 gene itself and SPA, a transcription 
factor previously identified as an activator of Glu-Dl which 
codes for an LMW-GS. 



Discussion 

Five QTLs have been found with a high confidence level 
(percentage of bootstrap replicates giving significant results) 
over two years (1999 and 2000) for quantitative variation of 
several wheat storage proteins. Two QTLs colocate with 
their structural genes (Gli-Al and Glu-Bl), whereas the oth- 
ers do not. In this study, we focused on the QTL on chromo- 
some IB in particular, which shows both the largest effect 
and a colocation with two possible candidate genes: the 
structural gene Glu-BJ-1 itself and SPA, the gene encoding a 
putative ly interacting transcription factor. 

Before now, the chromosomal location of wheat SPA was 
unknown. Southern blot analysis revealed that one SPA locus 
exists on the long aim of each homoeologous group 1 chro- 
mosome. However, this technique does not permit determi- 
nation of whether these loci contain a single or several 
tandem copies of SPA. An SNP marker was developed, 
which made it possible to locate this gene on chromosome 
IB. This is in agreement with results obtained by mapping 
with RFLPs. 

The SNP used to map SPA corresponds to the allele from 
'Renan*. It introduced a stop codon into the coding sequence 
defined by Albani et al. (1997). This stop is fairly close to 
the start codon of the published sequence (it is the 6th codon 
after the start codon). It is followed by four more codons 
and another ATG codon, which could constitute a new po- 
tential start codon. This organisation creates a new short up- 
stream open reading frame (uORF), in addition to those 
described by Albani et al. (1997). The presence of a few 
short uORFs is frequent in the mRNA of transcriptional fac- 
tors where they constitute translational control elements. For 
example, translation of mRNA of the maize transcriptional 
activator Opaque-2 is inhibited by ORFs (Lohmer et al. 



1993). Moreover, such an organisation may also lead to a 
shorter protein with a non-altered bZip domain than that pre- 
dicted by Albani et al. (1997). Protein size polymorphism is 
also reported for maize bZip activator Opaque-2 (Hartings et 
al. 1995). Surprisingly, this truncated protein corresponds to 
'Renan,' which has the greatest quantity of Glu-Blx. 

The QTL for Glu-Blx had two candidate genes: SPA and 
Glu-BI-1, both located within the confidence interval of the 
QTL. This result indicates that both candidates may be in- 
volved in the synthesis of this HMW-GS. 

SPA function has been elucidated (Albani et al. 1997; 
Conlan et al. 1999) and it was demonstrated that this bZip 
protein can activate transcription of an LMW-GS gene via 
the GLM. SPA may be involved in the activation of many 
other wheat prolamin genes, which generally contain con- 
served GLM in their promoter, since the promoters of 
HMW-GS genes are characterized by a motif called "cereal a 
box", which contains a GLM. -Results reported by Norre et^ 1- * 
al. (2002) supported a~ possible interaction between Opaque-^ 
2, a bZIP factor of maize, and DNA-binding sites of the 
HMW-GS promoter, for Glu-Dl -1. Moreover, group 1L 
chromosome arms often appear to have positive effects on 
HMW-GS genes (Wanous et al. 2003). Therefore, the loca- 
tion of SPA in the confidence interval of a QTL for the x 
unit of Glu-Bl-l might suggest that SPA would also be able 
to activate expression of HMW-GS genes. SPA might have 
several targets. Such a phenomenon has already been dem- 
onstrated for Opaque-2. This transcriptional factor was 
shown to differentially influence transcription rates of ot-zein 
genes, which shared common c/s-acting elements, including 
the binding site for Opaque-2 (Kodrzycki et al. 1989). It is 
also required for the expression of a number of non storage 
protein genes including b-70 and b-32 (Di Fonzo et al. 1979; 
Lohmer et al. 1991). 

As expected, the structural gene Glu-Bl ~1 is also a candi- 
date for the QTL on chromosome IB. This result may be ex- 
plained by polymorphism between the promoters of alleles 6 
and 7 of Gtu-Bl-1. Considerable polymorphism was de- 
tected between the promoters of several HMW-GS genes by 
Anderson et.al. (1998). These promoters can differ by the 
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cereal box copy number. More recently, a variability has 
been reported between two versions of the promoter of allele 
7 of Glu-Bl-1: one version presents a duplication of a 43 -bp 
element, which is systematically associated with an over- 
expression of Bx7 (Juhasz et al. 2003). However, the poly- 
morphism of the promoters for Glu-Bl-I is not yet fully de- 
scribed for every allele. 

Therefore, our results show that it is difficult to separate 
the respective effects of the two candidates and that farther 
analysis is necessary. It would be interesting to create and 
analyse new RILs with parents segregating for only one can- 
didate gene. Ultimately, over expression and knock out of 
SPA could be of interest in studying the influence of this 
gene on storage protein synthesis. Moreover, it should be re- 
membered that other transcriptional factors (Carbonero et al. 
2000; Diaz et al. 2002) are involved in the complex regula- 
tory pathway of cereal storage proteins, the dissection of 
which has just begun. 
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Abstract 

Cassava vein mosaic virus (CsVMV) is a pararetrovirus that infects cassava plants in Brazil. A promoter fragment 
isolated from CsVMV, comprising nucleotides -443 to +72, was previously shown to direct strong constitutive 
gene expression in transgenic plants. Here we report the functional architecture of the CsVMV promoter fragment. 
A series of promoter deletion mutants were fused to the coding sequence of uidA reporter gene and the chimeric 
genes were introduced into transgenic tobacco plants. Promoter activity was monitored by histochemical and 
quantitative assays of /i -glucuronidase activity (GUS). We found that the promoter fragment is made up of different 
regions that confer distinct tissue-specific expression of the gene. The region encompassing nucleotides -222 to 
- 1 73 contains cis elements that control promoter expression in green tissues and root tips. Our results indicate 
that a consensus asl element and a GATA motif located within this region are essential for promoter expression in 
those tissues. Expression from the CsVMV promoter in vascular elements is directed by the region encompassing 
nucleotides -178 to -63. Elements located between nucleotides -149 and -63 are also required to activate 
promoter expression in green tissues suggesting a combinatorial mode of regulation. Within the latter region, a 
43 bp fragment extending from nucleotide - 14 1 to -99 was shown to interact with a protein factor extracted from 
nuclei of tobacco seedlings. This fragment showed no sequence homology with other pararetrovirus promoters and 
hence may contain CsVMV-specific regulatory cis elements. 



Introduction 

Caulimoviruses (type member: cauliflower mosaic 
virus, CaMV) and badnaviruses (type member: com- 
melina yellow mottle virus, ComYMV) are grouped 
together as plant pararetroviruses [34]. These viruses 
possess a circular double-stranded DNA genome of 
around 8 kb that is replicated through a process that 
includes a step of reverse transcription [30]. Repli- 
cation is initiated by the synthesis of a terminally 
redundant genome- length RNA under the control of 
a viral promoter, also called the genomic promoter 
[34]. Pararetrovirus genomic promoters interact with 
the plant RNA polymerase II [27] and their regulation 



is apparently not dependent upon any virus-encoded 
transcriptional factors [26]. Consequently, these pro- 
moters were isolated from pararetrovirus genomes to 
express foreign genes in transgenic plants [6, 25]. The 
35S promoter of CaMV has been widely used in plant 
biotechnology and its functional organization is well 
described. This promoter is highly active in plant cells 
and directs constitutive gene expression in most tis- 
sues of transgenic plants [15, 28, 40]. The regulatory 
region of the 35S promoter is made up of distinct 
domains that are each involved in organ-specific ex- 
pression [1,3]. It has been suggested that the strength 
and the constitutive properties of the 35 S promoter de- 



1056 



rive from additive and synergistic interactions between 
these different domains [2, 10]. Within these promoter 
domains two c/s-acting elements were characterized. 
The activating sequence 1, or as 1, is a TGACG repeat 
motif extending from —82 to —64 that is the binding 
site for the tobacco transcription factor TGAIa [16, 
19]. It was shown that this cis element has an impor- 
tant role in the control of gene expression in roots [11, 
1 9] and functions in synergy with upstream domains 
to potentiate promoter activities in other tissues [1, 
3, 10, 1 1]. Also, a GATA motif located at from -85 
to -105, called activating sequence 2 (as2)7 interacts 
with nuclear proteins from tobacco leaf tissues and has 
biological importance for promoter activity in those 
tissues [20]. A promoter similar to the 35S CaMV pro- 
moter was isolated from figwort mosaic caulimovirus, 
FMV [35]. This promoter, referred to as the 34S 
promoter, also directs constitutive gene expression in 
transgenic plants and is of comparable strength to the 
CaMV 35S promoter. Sequence comparison between 
the two promoters in the region extending from — 100 
to +1 reveals 65% nucleotide homology [35]. Based 
on the similarity between these two promoters, it has 
been suggested that similar mechanisms of regulation 
may govern their expression. 

The cassava vein mosaic virus (CsVMV) is a 
double-stranded DNA virus that infects cassava in 
Brazil [17, 21] and has previously been listed as 
a putative member of the genus Caulimovirus [14]. 
However, the recent molecular characterization of the 
CsVMV genome [4, 7] revealed that this virus has 
a specific genome organization not found in other 
caulimoviruses or badnaviruses that have been char- 
acterized. Also it was proposed that CsVMV might 
be representative of a new genus of double-stranded 
DNA plant viruses [7]. Recently, we isolated a pro- 
moter fragment from the CsVMV genome [37] that 
is capable of driving high levels of heterologous gene 
expression in different plants and in protoplasts. This 
promoter directs expression of a reporter gene in all 
organs and in different cell types of transgenic plants 
suggesting a constitutive activity similar to that of the 
CaMV 35S promoter. However, the CsVMV has lim- 
ited sequence homology with the two caulimovirus 
promoters previously described. These facts suggest 
the presence of different c/s-elements in the promoter 
and therefore the possibility of different regulatory 
mechanisms compared with the CaMV 35 S promoter. 
To address this question, we analyzed the functional 
organization of the CsVMV promoter using a deletion 
analysis of the region upstream of the TATA box. The 



effects of various deletions on promoter activity were 
monitored in transgenic tobacco plants using the uidA 
reporter gene. Additional data were also provided by 
transient expression experiments in protoplasts and by 
in vitro DNA-binding assays. We show that the consti- 
tutive pattern of expression of the CsVMV promoter 
is due to the presence of distinct organ specific ele- 
ments. The relative importance of the different regions 
of the CsVMV promoter as well as its mechanisms of 
regulation are discussed. 



Materials and methods 

Promoter constructs and plasmids 

The starting plasmid for this study was pILTAB:CVP2 
which contains a CsVMV promoter fragment extend- 
ing from +72 to —443 [37]. Due to the absence of 
convenient restriction sites in the CsVMV promoter 
fragment, polymerase chain reaction (PCR) was used 
to generate a set of 5'-terminal and internal deletions. 

Deletions from the 5' end of the promoter were 
obtained by PCR amplification. A common reverse 
primer PI' (Table 1) which hybridizes at the 3' end 
of the promoter was paired with CsVMV-specific 
primers P2, P3, P4, P5 and P6 (Table 1) to gener- 
ate five promoter fragments designated B, C, D, E 
and F. These fragments have a common 3' terminus 
at position +72 and their 5'-end points at positions 
-330, -222, -178, -112 and -63, respectively 
(Figure 1). A full-length promoter fragment (A frag- 
ment) was also re-synthesized using the primer PI and 
PI' (Table 1). PCR reactions were carried out with 
100 ng of pILTAB:CVP2, 2.5 U of Tag DNA poly- 
merase (Gibco-BRL, Gaithersburg, MD) and standard 
concentrations of primers, MgCh and dNTP. Twenty 
cycles (94 °C, 30 s; 56 °C, 30 s, 72 °C, 30 s) of ampli- 
fication were performed and were followed by 5 min 
of final elongation at 72 °C. Each of the five amplified 
DNA fragments was digested by BamHl and £c<?RI 
and ligated into the same sites of a plasmid containing 
the coding sequence of the uidA gene (coding for p- 
glucuronidase) linked to the 3' polyadenylation signal 
of the nopaline synthase gene. The resulting plasmids 
were named pA, pB, pC, pD, pE, pF according to the 
promoter deletion they carry (Figure 1). 

The internal promoter deletions were constructed 
through a two-step process. First, PCR reactions were 
performed as described above to generate a set of 3'- 
end deletions of the CsVMV promoter. The PI sense 
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Table J. Oligonucleotides used to generate CsVMV promoter fragments by PCR amplification. 
Primers contain CsVMV promoter sequences in sense (+) or reverse orientation (-). Coordi- 
nates of the primers relative to the transcription start site are noted. PI ' in association with P2 to 
P6 were used to create 5' terminal deletions of the CsVMV promoter as described in Materials 
and Methods. Similarly, PI in association with P2' to P7' were used for 3' end deletions. PI 
and PI' contain, respectively, aXbal site and a EcoRl site at their 5' ends while other primers 
encode BamHl site. The restriction sites are indicated by bold letters. P7 and P8 as well as 
the reverse primers P8' and P9' were used to generate probes and competitor fragments for gel 
retardation assays (see Materials and methods for details). 



Name 


Primer sequence (5' to 3') 


Position 




Orientation 


PI 


G CTCTAG ACC AG AAGGTAATTATCC AAG 


-443/ - 


-423 


+ 


P2 


TATG G ATCCTATGTTC AAAA ATG AAG 


-330/ - 


-312 


+ 


r J 




-222/ - 


-206 


I 


P4 


AGAGGATCCGGTCGGTGATTGTGAA 


-178 - 


/163 


+ 


P5 


A AAG G ATCCTTATC AC A A AGG AATC 


-112/- 


-95 


+ 


P6 


TATGGATCCGTGTCATTTTTGCCCTTG 


-63/- 


43 


+ 


P7 


GAGACATAGAGAGGACACATGT 


-161/- 


- 142 


+ 


P8 


AAGGTGGAAAATGTAAGGGCG 


-141/- 


- 121 




PI' 


CG G A ATTC AAACTTAC AA ATTTCTCTAAG 


+72/ + 50 




P2' 


TAAGGATCCTTTCCGCCCTTACATT 


-116/- 


- 132 




P3' 


CATGGATCCTCTATGTCTCTTTCAC 


-149/ - 


- 168 




P4' 


ACAGGATCCGACCTTATCTTCT 


— 173/ - 


- 187 




P5' 


ACCGGATCCTCTTCTTTTCATTGTTC 


-182/ - 


- 199 




P6' 


TCAGGATCCTTTTCTTCGCCTGGT 


-228/ - 


-243 




py 


ATAGGATCCATATGTGCCGCATA 


-334/ - 


-348 




P8' 


CCTTTGTGATAAGGTTACTTTCC 


-99/- 


121 




W 


GACACGGAAAAATATAAAAGG 


-56/- 


76 





Xbal, BamHl 



EcoW 




Mesophyll Vascular Root tips 



-334Z-222 pAB-^ESSS 

-228M78 PAC 

-182/112 PAD1 

-173/- 112 PAD2 

-149/- 112 PAD3 

-182/-63 PADE1 

-173/-63 PADE2 

-149/-63 PADE3 

-U6/-63 PAE 



+ 

+ 
+ 

+ 

(+/-) 
+ 

(+/-) 

+ 
+ 

(-) 
(+/-) 
(+/-) 



+ 



Figure I. Schematic representation of the deleted CsVMV promoter: uidA fusion genes and a summary of histochemical detection of GUS 
activity in transgenic tobacco plants. The names of the different plasmids and the end points of the 5' and internal deletions are indicated. 
Internal deletions are designated by symbol A. The construct pA contains a full-length CsVMV promoter. All promoters deleted from the 5' 
end have a BamHl site at their 5' ends. Internal deletions were created by BamHl ligation of the promoters truncated from the 5' end with those 
truncated from the 3' end (see Materials and methods). The data summarize the observations performed on ten different transgenic Rl tobacco 
plants for each promoter construct. The transgenic plants were analyzed for GUS expression in mesophyll cells, vascular tissues and root tips. 
GUS expression is scaled in each tissue according to 4 levels. +: no visible difference with the full length promoter; (+/-): lower staining 
than with the full length promoter; (-): very low expression; — : no detectable staining. 
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Figure 2. Histochemical localization of GUS expression in transgenic tobacco plants containing deleted CsVMV promoters: w/'aW fusion gene 
constructs. Pictures presented, with the exception of h and i f are cross-sections of young expanded leaves from 5-week old transgenic Rl tobacco 
plants, a. Construct pB. This expression profile in leaf was also observed with plants containing the construct pC. As with the full-length CsVMV 
promoter, strong GUS expression was detected in photosynthetic tissues (palisade and spongy mesophyll) as well as in the phloem cells and 
in the xylem parenchyma of the midrib, b. Construct pD. Strong staining was localized in the lateral secondary veins, in the phloem tissue and 
in the xylem parenchyma of the midrib while no or weak expression was detected in the mesophyll tissues, c. Construct pE. Secondary lateral 
veins as well as the vascular elements of the midrib exhibited a weak staining, d. Construct pAC. Strong staining was detected in the vascular 
tissues of the midrib and in the lateral veins. We noticed a decrease in GUS expression in photosynthetic tissues as shown on the picture by a 
moderate staining in spongy mesophyll cells, e. Construct pADl. The same GUS expression pattern was observed with the constructs pADE2 
and pADE3. Moderate staining is visualized in the lateral veins and in the phloem cells of the midrib, f. Construct pADEl. No expression 
was detectable with the exceptions of few phloem cells that exhibited a very weak staining, g. Construct pAD2. Plants carrying the deletion 
constructs pAD3 and pAE displayed similar GUS expression pattern. GUS expression was detected in the mesophyll and vascular tissues and 
was in the same range of intensity as what we observed with the undeleted promoter, h. Leaves from 10-day old transgenic seedlings carrying 
pB construct (left) or pD construct (right), i. Roots from transgenic tobacco plants containing the pC promoter construct (top) or pD promoter 
construct (bottom), m, mesophyll; py, parenchyma, v, vascular tissue. 



primer (Table 1 ) which hybridizes at the 5' end of the 
promoter was paired with each of 6 specific CsVMV 
reverse primers, P2' to P7' (Table 1), to generate six 
truncated promoters with a common 5' end at posi- 
tion —443 and 3' end points spanning from position 

— 116 to —334. Internal deletions were engineered 
by cloning the different 3' end truncated promoter 
fragments upstream of the 5' end deleted promoter 
constructs (Figure 1). Accordingly, a 3' deleted pro- 
moter fragment encompassing nucleotides —443 to 
—334 was digested by BamlW and Xbal and ligated 
to the same sites in the pC plasmid. The resulting 
plasmid, named pAB (Figure 1) contains an internal 
deletion from nucleotides —334 to —222 (Figure 1). 
Similarly, a fragment spanning nucleotides —443 to 
—228 was fused to the D promoter fragment to create 
the plasmid pAC (Figure 1). Three fragments with 
a common 5' end at —443, and 3' ends located at 

— 182,-173, and — 149 were cloned individually into 
the plasmid pE to create the plasmid pADl, pAD2 
and pAD3, respectively (Figure 1). The same three 
fragments were cloned into pF to create the plasmid 
pADEl, pADE2 and pADE3 (Figure 1). A fragment 
containing nucleotides —443 to — 1 1 6 was cloned with 
the same method into the plasmid pF to generate the 
plasmid pAE (Figure 1). All promoter sequences were 
verified by dideoxynucleotide sequencing. The differ- 
ent CsVMV promoter- uidA fusion genes were excised 
by Xbal and Hindlll and ligated to the same sites of 
the pBin 19 binary vector (Clontech, Palo Alto, CA). 

Transgenic tobacco plants 

The pBin 19 derivatives carrying the deleted pro- 
moter constructs were transferred by electropora- 
tion into Agrobacterium tumefaciens strain LBA4404. 
A grobacterium -mediated transformations of Nico- 
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Promoter constructs 

Figure 3. GUS enzyme activity in transgenic tobacco leaves car- 
rying CsVMV promoter constructs. Proteins were extracted from 
leaf disks collected from young expanded leaves of 5 week old 
transgenic tobacco plants. For each promoter construct, 6 to 10 
independent transgenic tobacco lines were assayed for GUS activity. 
Data are given as pmol of 4-methylumbelliferone per mg of protein 
per minute. The results obtained for each plant is represented by an 
open dot. The different promoter constructs analyzed in these assays 
are indicated. Average level of expression is indicated by a vertical 
arrowhead. A logarithmic scale is used to accommodate the large 
variation between transgenic lines. 



liana tabacum cv. Xanthi NN were performed as 
previously described [13]. Regenerated kanamycin re- 
sistant plants were grown to maturity in a greenhouse 
and allowed to self-fertilize. Rl seeds were germi- 
nated on Murashige and Skoog (MS) culture medium 
[24] with 100 mg/1 kanamycin and transferred to soil 
in a greenhouse. Between 10 and 20 independent 
transgenic lines were produced for each construct. Ten 
independent Rl lines for each promoter construct were 
analyzed for this study. 
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Figure 4. Transient GUS expression of CsVMV \uidA gene con- 
structs in BY-2 protoplasts. Electroporated protoplasts prepared 
from BY-2 cell suspensions were analyzed for GUS activity after 
24 h of culture. The deleted CsVMV promoter constructs were 
co- trans fected with a luciferase plasm id used as an internal standard. 
GUS expressions levels were normalized in relation to the activity of 
luciferase. Normalized values of GUS expression are expressed as 
a percentage of the full-length promoter activity. A 100% value was 
assigned to the construct pA which contains an undeleted CsVMV 
promoter. The different promoter constructs are indicated on the 
figure. Bars represent the average of four independent experiments 
+/- standard errors. 

GUS expression analysis 

For the histochemical detection of /3 -glucuronidase 
(GUS) activities, fresh tissue sections were taken 
from young expanded leaves and incubated for 6 
to 12 h at 37 °C in reaction buffer [15] contain- 
ing 1 mM 5-bromo-4-chIoro-3-indoiylglucuronide(X- 
gluc), 100 mM sodium phosphate buffer pH 7, 2 mM 
potassium ferrocyanide and potassium ferricyanide, 
and 0.1% Triton X-100. For the GUS analysis of 
young Rl seedlings, whole plantlets were collected 
around one week after germination and immersed 
in GUS buffer [15]. After vacuum infiltration, in- 
cubation was carried out overnight at 37 °C. Sam- 
ples were cleared by several washes in 70% ethanol. 
Quantitative GUS analyses using the substrate 4 
methylumebelliferone-0-S glucuronide (MUG) were 
performed as described by Jefferson et al [15]. 

Protoplast isolation, treatment and culture 

Protoplasts from BY-2 tobacco suspension cells were 
prepared essentially as described by Watanabe et al. 
[39]. A 30 ^g portion of plasmid DNA with 30 \i% 
of a 35S-luciferase construct [28], which served as 
internal standard, was introduced into protoplasts as 
previously described [37]. Protoplast were collected 
for protein extraction after 24 h of culture at 25 °C. 
The MUG and LUC assays were performed on the 



protoplast protein extracts as described [37]. Results 
were expressed as a ratio between the GUS activity of 
the CsVMV promoter construct and the LUC activity 
of the internal control. 

Probes and competitors 

DNA fragments used as probes in binding assays were 
generated by PCR using 200 ng of pILTAB:CVP2 
plasmid containing the full-length CsVMV promoter 
[37], Pfu DNA polymerase (Stratagene, La Jolia, CA) 
and its appropriate buffer, specific primers, and dNTPs 
as standard concentrations. A DNA fragment encom- 
passing — 161 to —56 was obtained using the primers 
P7 and P9' (Table 1) after 20 cycles of amplification 
performed with the following temperature profile: 30 s 
at 94 °C; 30 s at 58 °C; 45 s at 72 °C The PCR 
mixture was extracted with phenol/chloroform and the 
DNA was ethanol-precipitated. Amplified DNA was 
subjected to electrophoresis through on a 3% Biogel 
(Bio 101, La Jolla, CA) and the probe fragment was 
eluted using the Mermaid kit (Bio 101). Purified DNA 
was end- labeled with [y 32 P] ATP using T4 polynu- 
cleotide kinase (NEB, Beverly, MA). The probe was 
purified from unincorporated nucleotides by two suc- 
cessive rounds of ethanol precipitation in the presence 
of ammonium acetate. 

Specific DNA competitors were produced by PCR 
using Pfu polymerase and the conditions described 
above. In addition to the —161/— 56 fragment, three 
subsets encompassing — 161 to —99 (obtained with P7 
and P8' primers, Table 1), -141 to —56 (obtained 
with P8 and P9' primers) and -141 to -99 (obtained 
with P8 and P8' primers) were used as competitors. 
DNA fragments were gel purified using the Mermaid 
kit (Bio 101). A 557 bp DNA fragment isolated by 
EcoRI and £coRV digestion of the coding region of 
the uidA gene was used as a non-specific competi- 
tor DNA. This fragment contains uidA nucleotides 
from the ATG position to the first £coRV site in the 
sequence. 

Preparation of tobacco nuclear extracts 

A protein nuclear extract was prepared from 2- week 
old tobacco (N. tabacum cv. Xanthi NN) seedlings 
grown in growth chambers (16 h day light cycle). All 
purification steps were carried out at 4 °C. Nuclei were 
isolated following the protocol described by Giuliano 
et al. [12] except that benzamidine was omitted from 
the extraction buffer. The isolated nuclei were then re- 
suspended (50 /xl/g of starting tissue) in lysis buffer 
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containing 20 mM HEPES pH 8, 0.4 mM EDTA, 
0.5 mM DTT, 420 mM NaCl, 25% (v/v) glycerol, 
0.1 mM PMSF, 2 /ig/1 leupeptine, 3 fig/\ pepstatine 
and 40 fig/l antipain. The suspension of nuclei was 
homogenized by pipeting up and down and incubated 
on ice for 1 h (with occasional mixing). The resulting 
extract was centrifuged at 100 000 x g for 1 h. The 
supernatant was collected and dialyzed for 2 h against 
2 liters of dialysis buffer containing 20 mM HEPES 
pH 8, 50 mM KC1, 0.4 mM EDTA, 20% glycerol, 
0.1 mM PMSF and 0.5 mM DTT. The dialysate was 
centrifuged at 12 000 x g for 10 min to remove insol- 
uble material and the supernatant was aliquoted and 
stored at —80 °C. Protein concentrations of nuclear 
extracts were determined using the Bradford assay 
(BioRad, Hercules, CA). 

Gel shift assays 

Protein-DNA binding reactions were carried out on ice 
in 25 /xl of 1 x binding buffer (20 mM HEPES pH 7.8, 
50 mM KC1, 1 mM MgCl 2 , 0.05% Nonidet 40, 1 mM 
DTT and 10% glycerol) containing 2 fig of poly-dl- 
dC as non-specific competitor and an adequate amount 
of nuclear extract protein (amounts of nuclear extract 
are noted in the legend to Figure 5). The mixture was 
pre-incubated for 10 min on ice before addition of 
10000 cpm of the labeled probe. The reaction was 
allowed to proceed on ice for an additional 25 min. 

For competition assays, competitor DNAs were 
included in the reaction before addition of the la- 
beled probe. The amount of competitor used for each 
assay is indicated in Figure 5. Reaction mixtures 
were subjected to electrophoresis through a 5% non- 
denaturing polyacrylamide gel in lx TGE buffer, 
pH 8.5, at 130 V for 3 h at 4 °C. Gels were fixed in 
10% methanol, 10% acetic acid, dried and exposed 
overnight on X-ray films with an intensifying screen. 



Results 

Expression patterns of deleted CsVMV promoter 
constructs in transgenic tobacco plants 

The CsVMV promoter was mutated by progressive 
5' deletions and by internal deletions as described in 
Materials and methods. Expression patterns of the dif- 
ferent promoter constructs were analyzed in transgenic 
Rl tobacco plants using histochemical staining of 
GUS activity. The presence of an intact promoter: uidA 
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Figure 5. Interaction of a fragment extending from nucleotide — 1 6 1 
to —56 with nuclear extracts from tobacco seedlings. The fragment 
fl, spanning nucleotides —161 to —56, was end-labeled and incu- 
bated in the presence or absence (— ) of nuclear extract (N.ex) as 
indicated in Materials and methods. The free probe (FP) is visible 
at the bottom of the figure while a DN A/protein complex (RC) of 
slower electrophoretic mobility is marked by an arrow. In lanes 
4 to 10, 10 to 100 x molar excess of unlabeled competitor DNA 
were added to the binding reaction mixtures. In addition to the fl 
fragment, f2, £3 and f4 fragments were used as competitors. In lane 
10, lOOx molar excess of a 557 bp fragment isolated from the uidA 
coding sequence was used as a non-specific competitor. 



gene cassette was confirmed by PCR and/or South- 
ern analysis for each transgenic plant line used in this 
study (data not shown). For each promoter construct, 
we obtained between 10 to 20 transgenic plant lines. 
GUS expression patterns observed between plants 
containing the same promoter construct were similar 
while we observed variations in the level of trans- 
gene expression among individual transformants. The 
plants tested in this study contained between 1 and 5 
copies of the uidA fusion gene (data not shown). The 
copy number did not affect the characteristic pattern 
of expression observed with each construct and there 
was no clear correlation between the copy number 
and the apparent intensity of the staining. For each 
promoter construct, we always obtained a small num- 
ber of transgenic plants for which the expression of 
the transgene was not detected despite its presence as 
shown by Southern blotting, suggesting a possible si- 
lencing phenomenon [29]. Data obtained from these 
plants were not taken into account for the current dele- 
tion analysis. However, in the lines selected for this 
study, transgene expression was stable during plant de- 
velopment and the pattern of GUS staining was similar 
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in the primary transformants and in the Rl generation 
indicating that silencing phenomena did not occur in 
the selected plant lines [9]. 

Different patterns of expression as well as signif- 
icant and reproducible differences in the staining in- 
tensities between promoter constructs could be clearly 
visualized despite some variation between individual 
transformants. It is likely that the reproducible differ- 
ences detected between promoter constructs indicated 
an effect of the deletion on the promoter regulation. 
We reported previously that the CsVMV promoter 
is expressed in all organs of transgenic plants, with 
regions of highest promoter expression located in vas- 
cular tissues, mesophyll cells of the leaves and the 
root tips [37]. Accordingly, for the present study, GUS 
activity was analyzed in these three different tissues 
(Figure 1). 

5 '-end deletions 

GUS staining in transgenic plants that contained the 
promoter constructs deleted to —330 and —222 (con- 
structs pB and pC) occurred in the same pattern and 
was in the same range of intensity as what was ob- 
served with the full length promoter (Figures 1 and 
2a). Further deletion of the promoter to -178 (con- 
struct pD) caused an important change in the GUS 
expression pattern (Figure 1). In leaf cross-sections of 
most of the plants carrying the pD construct, a strong 
staining restricted to the vascular tissues was observed 
(Figure 2). GUS staining was absent in the mesophyll 
cells of most of the plants. However, three plants lines 
out of ten presented a low staining in these cells. In 
all plants transformed with the pD construct, the root 
tips did not exhibit GUS staining (Figure 2i) while this 
tissue was intensely stained in plants containing the pC 
promoter construct. GUS activity from the promoter 
construct pE, deleted to —112, was restricted to the 
vascular tissues (Figures 1 and 2c). The intensity of 
the expression was very low and long incubation times 
were required to detect a blue precipitate in these tis- 
sues. The construct pF did not display any detectable 
expression (Figure 1). 

Internal deletions 

Internal deletion of the region from —334 to —222 
(promoter AB) did not affect the general expression 
pattern of the CsVMV promoter (Figure 1). A signifi- 
cant decrease in GUS accumulation was visualized in 
the spongy and palisade mesophyll cells of the plants 
transformed with pAC (Figures 1 and 2d). However 
the deletion did not suppress expression in vascu- 



lar elements and root tips, which exhibited a strong 
staining. Transgenic plants containing p AD 1 (deletion 
from —182 to —1 12) displayed a weak GUS staining 
restricted to the phloem cells (Figures 1 and 2e). Ad- 
ditionally, GUS expression was also observed in the 
root tips. The internal deletion encompassing -182 
to —63 (pADEl) had a profound effect on promoter 
activity (Figure 1). Of the 10 independent transgenic 
plants tested, 8 did not have any detectable GUS ac- 
tivity in the leaves or in stems. Very pale blue staining 
localized in the phloem cells was visualized in two 
plants after prolonged incubation (Figure 20- In con- 
trast, strong staining was revealed in the root tip as 
well as weaker staining in the vascular elements of 
the root. The construct pAE displayed a constitutive 
pattern of expression similar to the undeleted promoter 
(Figures 1 and 2g). 

These results emphasized the importance of the 
(—182/— 112) region for constitutive expression of 
the CsVMV promoter. We observed that the deletion 
pADl suppressed promoter activity in the mesophyll 
tissue and also diminished vascular GUS expression 
(Figure 2e). Consequently, we made the constructs 
pAD2 and pAD3 to investigate this region in greater 
details (Figure 1). The construct pAD2, deleted from 
nucleotides -173 to —1 12, displayed a profile of ex- 
pression similar to the full length CsVMV promoter 
(Figure 2g). This result suggested that the addition of 
9 nucleotides at the 5' end point of the (- 1 82/- 1 1 2) 
deletion could restore the full expression pattern al- 
tered with the construct AD1. No significant differ- 
ence was detected between the construct pAD2 and 
pAD3. 

In contrast, addition of the sequences —182/— 173 
and -182/- 149 to pADEl to make pADE2 and 
pADE3, respectively, did not lead to a restoration 
of promoter activity (Figures 1 and 2e), suggesting 
that elements located between —112 and -63 might 
influence promoter activity. 

Histochemical GUS analysis in young seedlings 

A histochemical GUS analysis on 10-day old Rl 
seedlings was carried out in order to analyze the ex- 
pression pattern of the deleted promoter at earlier 
stages of development. The deleted promoter con- 
structs directed specific expression patterns that were 
similar to those observed in adult plants. As in ex- 
panded leaves of adult plants, the constructs pD, pE 
and pADl conferred a vascular-specific pattern of 
expression in cotyledonary leaves (Figure 2h). Sim- 



ilarly, the pB and pC constructs which were active 
in both mesophyll and vascular tissues in tobacco 
plants had the same constitutive expression pattern in 
seedlings (Figure 2h). These results suggested that 
specific expression patterns observed with the differ- 
ent deletions in transgenic plants are not influenced by 
the developmental stage of the plant. 



GUS activity in leaf extracts of transgenic plants 

GUS activities in protein extracts prepared from leaf 
tissues were measured using a fluorometric assay [15]. 
The samples were collected from interveinal tissue of 
young expanded leaves from 5 -week old transgenic 
tobacco plants. Consequently, the GUS activity levels 
reflected mainly the promoter expression in mesophyll 
tissue. As shown in Figure 3, the values of GUS activ- 
ities in transgenic lines carrying a specific promoter 
construct varied by a maximal factor of 1 7. We did not 
observe a clear correlation between levels of expres- 
sion and transgene copy number (not shown) as plants 
with the same copy number exhibited large differences 
in levels of GUS activity. 

A significant decrease in level of GUS expression 
was detected in transgenic plants containing the pD 
promoter construct. A low level of GUS activity was 
also measured with the internal deletion pAC con- 
firming the importance of the region extending from 
-222 to -178. This result was consistent with the 
histochemical analysis since pD and pAC displayed 
low or no staining in mesophyll cells of transgenic 
plants. Similarly, low levels of GUS activity were 
measured for the constructs pADl, pADEl, pADE2 
and p ADE3 which had vascular-specific profiles of ex- 
pression in leaves of transgenic plants. The average 
level of activity measured with the construct pAD2 
was higher than with the pADl construct confirming 
that the region from — 1 82 to — 1 73 is involved in pro- 
moter activation. However, the pAD3 activity levels 
were in the higher range while the construct pAD2 
gave moderate levels of expression. This difference 
was not detected using the histochemical assay (Fig- 
ure 2g). As observed with the histochemical assay, 
the high expression level measured with the construct 
pAD3 was abolished by deletion of the region span- 
ning — 1 12 to —63 (construct pADE3). The construct 
pAE retained a high level of GUS activity suggesting 
that the promoter expression in green tissues is weakly 
affected by this deletion. 
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Expression of the CsVMV promoter deletions in 
protoplasts 

Protoplasts prepared from BY-2 suspension cells were 
transfected with the CsVMV promoter constructs. 
Transient expression of the uidA reporter gene was 
measured 24 h after electroporation in relation to 
an internal standard expressed from a cotransfected 
luciferase plasmid. Four independent transfection ex- 
periments were carried out; the results obtained are 
summarized in Figure 4. 

The construct pC which contains a CsVMV pro- 
moter deleted to position —222 retained 88% of the 
activity of the full-length promoter fragment. The pro- 
moter activity dropped sharply to only 24% of full 
activity with a further deletion to -178. Constructs 
pD and pE had almost the same expression level while 
a second drop of activity was observed with a deletion 
extending to —63. The construct pF with 12% of the 
full promoter activity was just above background level. 
An internal deletion from —228 to —178 (construct 
pAC) decreased expression by more than 50%. Sur- 
prisingly the constructs p AD 1 andpADEl which gave 
very low expression in transgenic plants allowed high 
levels of expression in protoplasts. The (—222/ — 1 78) 
region plays a critical role for promoter activity in pro- 
toplast while the region from — 1 78 to —63 appears to 
be of lower importance for expression of the promoter 
in protoplasts than in plants. 

DNA-protein interactions in the — 161 to —56 region 

A labeled DNA fragment encompassing - 161 to —56 
was used to detect specific binding with nuclear pro- 
tein extracts prepared from tobacco seedlings. Interac- 
tion of the (— 1 6 1 / — 56) probe with the nuclear extract 
generated a single complex of slower mobility than un- 
bound probe (Figure 5). The intensity of the retarded 
band increased with increasing amount of nuclear ex- 
tract in the reaction. The formation of the complex 
was reduced or abolished by addition of an excess of 
unlabeled probe whereas addition in the reaction of a 
an unrelated DNA fragment had no effect (Figure 5). 
This result suggested that one or more DNA-binding 
proteins in the tobacco nuclear extract interact with 
c/5-elements within the (— 1 6 1 / — 56) promoter region. 

To map more precisely the binding site of this 
interaction, we performed competition assays with 
sub-fragments of the probe. The complex was elimi- 
nated when overlapping sub fragments spanning — 1 6 1 
to -99, -141 to -56 or -141 to -99 were added 
to the reactions at 100-fold molar excess in the assay. 
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This suggested that a binding site for a protein factor 
is located between —141 and —99. 



Discussion 

This study showed that the CsVMV promoter has a 
complex structure made of discrete domains that exer- 
cise distinct influences on patterns of tissue specific 
expression. Our results suggest that the constitutive 
activity of the promoter is the result of additive and 
synergistic interactions between different promoter el- 
ements. The functional organization of the CsVMV 
promoter as defined by the deletion analysis is sum- 
marized in Figure 6. 

Promoter expression in green tissues is directed 
by the region extending from —222 to —173. In this 
region, we identified a consensus sequence asl [19] 
at -219/-203 as well as a GATA sequence at po- 
sition — 183/- 180. The construct pADl (deletion of 
the region from — 182 to — 1 12) exhibited a low level 
of expression (Figure 2e) and showed that as asl el- 
ement on its own cannot activate gene expression in 
mesophyll tissue. As suggested when we analyzed 
the expression patterns of the construct pADl and 
pAD2, the sequence extending from —182 to —173, 
which contains a GATA motif, is essential for pro- 
moter activity in green tissue. However, the specific 
role of this GATA region independent of the asl el- 
ement cannot be assessed based on expression of the 
constructs used in this study. Consequently, two hy- 
potheses are presented: either the GATA region, on 
its own, controls promoter expression in green tissues, 
or the GATA region and the asl element act together 
to control the CsVMV promoter activity in these tis- 
sues. Published data reported that a GATA motif in the 
CaMV promoter, referred to as the activating sequence 
2, as2 [20], is also involved in leaf expression. More- 
over, gene expression in leaf tissues controlled by the 
GATA region in the CaMV promoter was dependent 
on sequences located within the —90 to —46 region 
(which contains as asl element). Similar interactions 
may control expression of the CsVMV promoter in 
green tissues. However, the GATA motif identified 
in the CsVMV promoter is not identical with the as2 
element of the CaMV promoter. We found a greater 
sequence homology with a GATA box identified in the 
rice tungro bacilliform badnavirus promoter [5, 42] 
which plays an important role in the activation of that 
promoter [43] and with a box I element (GATAAPu) 



found in several light and circadian-clock-regulated 
promoters [8, 36]. 

A low level of GUS expression was observed in 
mesophyll cells of plants transformed with the pro- 
moter construct pAC which deleted the asl and GATA 
regions (Figure 2d). It is unlikely that this residual 
GUS staining in green tissue is due to additional minor 
elements located in the region from -443 to —228. 
Indeed, this region in the context of the deletion p AD 1 
did not direct any GUS expression in mesophyll cells 
(Figure 2e). The GUS staining in mesophyll cells of 
plants containing pAC is due either to diffusion of 
the dye from intensely colored vascular tissues or to 
the new arrangement of sequence created by the dele- 
tion that could be favorable to gene expression in the 
mesophyll cells. 

The results of the studies in transgenic plants 
showed that expression of the CsVMV promoter in 
vascular tissues is directed by the region encompass- 
ing — 178 to —63 and is not dependent upon the 
presence of as 1 or the GATA region (as shown by 
the construct pD, Figure 2b). The latter elements 
may have a moderate effect on vascular expression as 
suggested by the construct pADE2 and pADE3 (Fig- 
ure 2e) but is clearly not required for full promoter 
activity in vascular tissues. The expression pattern dis- 
played by construct pE showed that a vascular domain 
is located in the — 1 12/— 63 region. We detected in this 
region a 22 nucleotide sequence containing a CTTATC 
repeat previously identified in the Com YM V promoter 
[23]. However, the results obtained showed that ad- 
ditional elements located between —178 to —1 12 are 
responsible for the strong vascular expression (Fig- 
ure 6). Within the latter region, important elements 
may be contained between — 149 to — 1 1 6 as suggested 
by comparing the expression pattern of pADE3 and 
pAE. 

In vitro binding assays performed with the region 
— 161 to —56 revealed a specific interaction with nu- 
clear protein. We detected a single retarded band in 
these assays, the formation of which was efficiently 
disrupted by competition with a 43 nucleotide frag- 
ment extending from —141 to —99. This was some- 
what surprising since regions from — 149 to — 1 1 2 and 
from —112 to —63 play an active role in expression 
of the promoter. It is possible that certain binding ac- 
tivities are not detected due to a low concentration of 
transcriptional factors in our nuclear extract, to a lower 
affinity for the binding site, or because cooperative 
binding with other factors is required. 
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|asl|GATA| |GTAA | CTTATC | TATA | 



-443 



-222 -182 -173 -149 -112 



-63 




A. Mesophyll tissues 



B. Root tips 



] C. Vascular tissues 



Figure 6. Summary of the Cs VM V promoter regions required for uidA gene expression in various tissues of transgenic tobacco plants. Coor- 
dinates relative to the transcription start site indicate the position of promoter domains as denned in this study. Promoter domains required for 
expression in mesophyll tissues (A), in root tips (B) and in vascular tissue (C) were identified and are indicated below the map. A. An essential 
promoter region for expression in mesophyll tissues (hatched grey box) is located between -222 and -173. This region contains a consensus 
asl element [19] and a GATA motif. The hatched boxes from - 149 to - 1 12 and from - 1 1 2 to -63 represent promoter regions that could not 
specify mesophyll expression by themselves but that were required in combination with the upstream hatched grey box to direct expression in 
green tissues. These putative synergistic interactions are represented in the figure by the arrows. B. Two promoter domains (hatched grey boxes) 
from -443 to -222 and from -222 to - 178 could direct gene expression in root tips. Our data suggest that these two domains have a redundant 
function. C. The region extending from — 149 to - 1 12 (hatched grey box), characterized by a GTAA repeat, was responsible for high level of 
gene expression in vascular tissues. The region extending from - 1 12 to —63 (open box) was also able to specify vascular expression but could 
not direct high level of gene expression. This region contains a CTTATC repeat previously identified within the ComYMV promoter sequence 
[23]. Formation of a complex between the promoter region extending from —141 to —99 and a protein factor present in tobacco nuclear extract 
is indicated by an open circle. 



More detailed examination of the sequence of this 
43 nt fragment revealed the presence of a GTAA re- 
peat located at -129 to -1 13. The upstream GTAA 
motif of the latter repeat resembles the core-sequence 
of the P-box identified in the zein gene promoters [38]. 
The tef\ box [32] located in the -100 region of the 
EF-la gene promoters contains a GTAA repeat with 
similarities to the GTAA box of the CsVMV promoter. 
Determination of the role of the GTAA repeat in the 
CsVMV promoter will require further study. 

Deletion of regions of the CsVMV promoter that 
are important for expression in vascular tissues, as in 
the construct pADE3, suppressed gene expression in 
green tissues. Expression in the mesophyll tissue was 
restored by the addition of nucleotides -419 to —1 16 
(construct pAE) or by the addition of nucleotides 
- 1 1 2 to -63 (construct p AD3). These results could 
be explained by an effect due to spacing between the 
'mesophyll domain' (namely, the asl element and the 
GATA motif) and the TATA box in construct pADE3. 
By contrast, the promoter is active when at least 49 
nucleotides are present between the GATA element 
(-182 to -173) and nucleotide -63. 

In the CaMV 35S promoter, the asl and GATA 
motifs are located between —64 and — 105 [20], much 
closer to the TATA box than in the CsVMV pro- 
moter. By analogy, the shorter distance between the 
GATA region and TATA box created by the pADE2 
and pADE3 internal deletions would not be expected 
to prevent the activity of the as 1 and GATA cis ele- 



ments. The data may suggest that additional exacting 
elements contained in the region —149/— 112 and in 
the region - 1 1 2/— 63 are required for promoter activ- 
ity in green tissues (Figure 6). Elements contained in 
these regions may act in synergy with the 'mesophyll 
domain' of the CsVMV promoter to promote gene 
expression in green tissues. 

The CsVMV promoter includes an important reg- 
ulatory domain for expression in root tips between 
-222 and -178 (Figure 2i). It is likely that the asl 
element contained in this region is involved in this 
function, based on analogy with the rcjle of the asl 
element of the 35 S CaMV promoter in root expres- 
sion [1, 11, 19]. However the construct pAC, which 
lacks the asl element, is strongly expressed in root 
tip, and indicates that other promoter-elements are also 
involved in root tip expression (Figure 6). 

In BY2 protoplasts, CsVMV promoter activity ap- 
peared to be controlled by the region encompassing 
—222 to —178 that contains the asl consensus se- 
quence. Similar results were obtained with leaf meso- 
phyll protoplasts (not shown). It was surprising to find 
that expression from the promoter is independent of 
sequences from — 1 82 to —63. Discrepancies of results 
between protoplast-based transient assays and trans- 
genic plants were also observed for the 35 S CaMV 
promoter [10, 18]. Protoplasts are in highly stressed 
physiological state [33] that could be responsible of 
activation or inactivation of various /nms-acting fac- 
tors interacting with the promoter. In this regard, 
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several reports on the responsiveness of the asl ele- 
ment to multiple stress-related signals such as auxins, 
salicylic acid, methyl jasmonate are particularly rele- 
vant [22, 31, 41, 44]. Interestingly, the activities of the 
promoter mutants in protoplasts correlates with their 
activities in root tips (Figures 1 and 4) suggesting that 
a similar array of /raws-acting factors is present in root 
tips and in protoplasts. 
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